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ABSTRACT 

The human telomeric DNA sequence with four 
repeats can fold into a parallel-stranded propeller- 
type topology. NMR structures solved under 
molecular crowding experiments correlate with the 
crystal structures found with crystal-packing inter- 
actions that are effectively equivalent to molecular 
crowding. This topology has been used for rational- 
ization of ligand design and occurs experimentally in 
a number of complexes with a diversity of ligands, at 
least in the crystalline state. Although G-quartet 
stems have been well characterized, the inter- 
actions of the TTA loop with the G-quartets are 
much less defined. To better understand the con- 
formational variability and structural dynamics of 
the propeller-type topology, we performed molecu- 
lar dynamics simulations in explicit solvent up to 
1.5 ns. The analysis provides a detailed atomistic 
account of the dynamic nature of the TTA loops 
highlighting their interactions with the G-quartets 
including formation of an A:A base pair, triad, 
pentad and hexad. The results present a threshold 
in quadruplex simulations, with regards to under- 
standing the flexible nature of the sugar-phosphate 
backbone in formation of unusual architecture 
within the topology. Furthermore, this study 
stresses the importance of simulation time in 
sampling conformational space for this topology. 



INTRODUCTION 

Telomeric DNA, containing guanine rich tandem repeats, 
is found at the end of all eukaryotic species, and is 
associated with a number of specialized proteins, 
forming the telomere. Its function is to protect chromo- 
somal DNA from degradation, end-to-end fusions and 
recombination (1). Telomeres shorten progressively due 
to the end-replication problem, a consequence of the 
inability of DNA polymerases to fully replicate the 
Spends of chromosomal DNA (2,3). Human telomeric 
DNA consists of repeats of the hexanucleotide sequence 
d(TTAGGG) n . A minimum of four repeats can fold into 
four-stranded intramolecular structures termed G-quad- 
ruplexes (4). These structures have been shown to play a 
role in telomere regulation and therefore have been 
proposed as a potential target for therapeutic intervention 
by small molecule ligands (4,5). 

The folding and formation of human G-quadruplex 
structures have been studied by a variety of biophysical 
and chemical probe methods (6-9). The central unit of 
G-quadruplexes is a hydrogen-bonded array of four 
guanine bases stacked on top of another to form the 
G-quartet stem. The highly electronegative channel 
running along the axis of the stem is stabilized by mono- 
valent cations (10). The TTA linker sequence is arranged 
diagonally, external to the helical-like stacks of the 
G-quartet stem, forming the loops. At least six intramo- 
lecular G-quadruplex structures with different topologies, 
formed from the human telomeric sequence, have been 
reported to date, including the basket, chair, 3 + 1 
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hybrid and propeller form (11,12). The elucidation of 
NMR solution structures reveals that the sequence can 
switch between parallel, antiparallel and hybrid conform- 
ations depending on the precise sequence, metal ions, 
solvent conditions, oligonucleotide concentration and 
possibly other factors (13). 

The 22-mer sequence, d[AG3(T 2 AG 3 )3] adopts a distinct 
topology in concentrated K + solution, as observed in the 
crystal structure (PDB id:lKFl) (10). The loops are in an 
open conformation in a propeller-like arrangement, con- 
sistent with the folding of a parallel-stranded 
G-quadruplex. The crystal structure contains three 
loops: Loopl (Thy5-Thy6-Ade7), Loop2 (Thyll-Thyl2- 
Adel3) and Loop3 (Thyl7-Thyl8-Adel9), which link 
the bottom (3'-end) and the top (5'-end) of the 
three-stacked G-quartets. The adenine in each TTA 
sequence is swung back forming a Thy-Ade-Thy stack 
(10). Recently, experiments carried out under molecular 
crowding conditions have demonstrated that the 22-mer 
sequence also adopts a similar propeller-type topology in 
the (non-crystalline) solution state (12). The environment 
inside a cell is molecularly crowded owing to macromol- 
ecular concentration of up to 40% (w/v) (14). Therefore, it 
can be speculated that human telomere quadruplexes can 
adopt a propeller-type topology under physiological con- 
ditions. The biological relevance of this topology can be 
established from the reports that it greatly decreases 
telomerase activity and processivity and therefore is a po- 
tential target for anti-cancer therapy (15). Furthermore, 
all G-quadruplex structures co-crystallized in combination 
with small molecule ligands/drugs also show propeller- 
type topology (16,17). This is unlike the other forms 
observed by NMR methods for native telomeric se- 
quences, suggesting that the preferred favoured topology 
for ligand binding is the intramolecular parallel-stranded 
propeller-type (18). In the light of these findings, we have 
used this topology in the present molecular dynamics 
(MD) simulation and aim to provide, for the first time, 
an extended analysis of quadruplex loop structural 
dynamics on a microsecond timescale. 

The MD simulations approach has provided much in- 
formation regarding the dynamic behaviour of DNA in 
solution, and an increasing number of studies have used 
this approach with quadruplex nucleic acids (12,19-21). 
Although X-ray crystallography provides detailed 
atomic-level information of a quasi-static state, general- 
ization cannot be made that crystal structures necessarily 
represent the major conformations. The X-ray structure of 
the propeller-type quadruplex reveals that although the 
dihedral angles for G-quartets are constrained in the 
range of B-DNA, the dihedral angles for loops vary 
from A-, B- and Z-DNA forms (22). The sugar-phosphate 
backbone has considerable potential for conformational 
variability within the quadruplex. Thus, although the 
G-quartet stem is considered to be well characterized by 
experimental and theoretical approaches, conformational 
plasticity and structural dynamics of the single-stranded 
loops as well as interactions of loops with the G-quartets 
are much less understood. As loops are likely dynamical, 
static structures alone cannot provide details of conform- 
ational transitions occurring within a quadruplex. Our 



earlier 15 ns MD simulations of the same sequence con- 
firmed that the G-quartet stem is a highly stable structure. 
The simulations revealed some flexibility in the loop 
region in line with other studies (20,23). However, no 
interactions of loops with the G-quartet stem were 
noted, and it is likely that sampling of the conformational 
space of the loops was not exhaustive (20). Owing to the 
recent improvements in force fields and wide availability 
of high-end processing power, microsecond (jis) long 
simulations of nucleic acids are becoming affordable and 
often dramatically change the picture obtained by shorter 
simulations and older force fields (24-28). 

In this study, we present a detailed analysis of a con- 
tinuous 1.5 |is long MD simulation of a quadruplex 
formed from the human 22-mer telomeric repeat. The 
study focuses on the conformational sampling and the 
dynamic nature of the connecting loops. The folding of 
the loops external to the G-quartet stem are of particular 
interest, as they present an interface for interactions with 
proteins, other nucleic acids and ligands. In addition, the 
topology and the conformation adopted by the loops have 
profound consequences for small-molecule drug design 
and development. The present simulation represents a 
threshold in G-quadruplex simulations, with regards to 
testing the force field for simulating four-stranded struc- 
tures, the role of cations, dynamic flexibility and structural 
variability of parallel stranded propeller-type topology 
G-quadruplexes. 

MATERIALS AND METHODS 

MD simulations 

The crystal structure of the parallel stranded 
propeller-type quadruplex formed from the human telo- 
meric sequence d[AG3(T 2 AG 3 )3] (PDB id:lKFl) was used 
as the starting structure for the simulation (10). The X-ray 
structure contains a vertical alignment of K + ions along 
the axis within the electronegative core of the structure. 
The ions, arranged in a square antiprismatic coordination, 
are sandwiched between the quartets and were retained in 
positions as in the crystal structure. The MD simulations 
were carried out using AMBER11 software using the 
ParmbscO version of the Cornell et al. force field 
(29-31). The structure was explicitly solvated with 
TIP3P water molecules in a periodic box 
(65 A x 64 A x 68 A) whose boundaries extended at least 
10 A from any solute atom. Additional 19 K + counter ions 
were added to the system to neutralize the charge on the 
quadruplex backbone, whose parameters were adapted 
from Hazel et al. (32). 

The protocol for energy minimization and MD in 
explicit solvent was adopted from Haider et al. (20). The 
initial round of equilibration with explicit solvent and ions 
involved 1000 steps of steepest descent, followed by 1000 
steps of conjugate gradient energy minimization. A 300 ps 
MD equilibration was performed in which the quadruplex 
was constrained, whereas the solvent and ions were 
allowed to equilibrate. The system was gently heated 
from OK to 300 K with a time step of 0.5 ps. This was 
followed by subsequent rounds of MD simulation, at 



Nucleic Acids Research, 2013, Vol. 41, No. 4 2725 



constant pressure and 300 K for 200 ps. The constraints 
were gradually relaxed, until no constraints were applied 
to the system. The final MD production run was per- 
formed at 300 K using the particle mesh ewald summation 
method with a charge grid spacing of ~1.0A (33). 
A cut-off of 10 A for non-bonded Lennard-Jones inter- 
actions was used, and the non-bonded pair list updated 
every 20 steps. The SHAKE algorithm was used to con- 
strain hydrogen atoms with a tolerance of 0.0005 A and a 
2fs time step (34). The final production run was carried 
out without any restraints on the system for a total of 
1.5 jis. Frames collected every 2 ps made up the trajectory 
and were analysed using the ptraj module in Ambertools 
1.4 (35). The simulation were run on GPU clusters and 
lasted ~45 days. 

Clustering 

Clustering analysis can detect and classify objects, 
described by structural data, into different groups based 
on structural similarity (36). To identify structural 
clusters from a trajectory, a root mean squared deviation 
(RMSD) parameter was used, where the pairwise distances 
measured as coordinates between structures are defined 
by a cut-off value reflecting the range of conformations 
and their relative populations. The algorithm generates 
centroids describing each cluster and then gives an 
RMSD for each structure in the trajectory with respect 
to each identified cluster. Clustering was carried out after 
removal of the first 300 ns, which were attributed to the 
equilibration phase. The frames were extracted at a time 
interval of 6ps yielding 50 000 frames. The o MMTSB 
toolkit code (29) was used with a cut-off of 3.0 A. 

Data representation 

The backbone angles a, (3, y, s, 5, £ and the glycosidic 
torsion angle x were monitored using the ptraj module 
of AMBER (35). Deoxyribose puckering pseudorotation 
angles (p) and amplitudes (A) have been determined fol- 
lowing the definitions of Altona and Sundaralingam (37), 
using the same reference state for P = 0.0 degrees. The 
torsion, pseudorotation angles and amplitudes were ex- 
tracted from the AMBER trajectory and plotted using 
MATLAB ®(2012a,The Mathworks City, USA). The 
structural illustrations were prepared using VIDA (open 
eye scientific software, Santa Fe, New Mexico), VMD (38) 
and PyMol (http://www.pymol.org) programs. The clus- 
tering data were plotted using the Xmgrace program 
(http://plasma-gate.weizmann.ac.il/Grace). 

RESULTS 

Structural dynamics on a microsecond timescale differs 
dramatically from shorter simulations 

The structural integrity of the quadruplex persisted during 
the complete course of the lengthy 1.5 jis of MD simula- 
tion. To analyse the importance of the timescale on which 
the propeller-type structures equilibrate and vary, the 
pairwise RMSD were assessed over both short and long 
simulation times (Figure 1). Most of the simulations of 



G-quadruplex systems published to date were performed 
on a time scale from 1 to 100 ns (12,20,23,39-42). One of 
the principal results of our work is that such simulations 
are insufficient to properly characterize structural 
dynamics of the G-quadruplex loops. As shown in 
Figure 1, different timescales provide dramatically differ- 
ent qualitative description of the structural dynamics. In 
fact, we show that the first ~300 ns of our simulation give 
a misleading picture of the loop structural dynamics. Only 
after this period does the system settle down. In addition, 
although the loops establish few characteristic stable inter- 
actions (mainly Adel-Adel3 and Thyl7-Gual5, see later 
in the text), they sample a rich ensemble of transient 
geometries. The loop geometry thus cannot be 
characterized by a single structure. Examination of the 
trajectory in short bins reveals that the structure can 
adopt stable conformations that can last several nanosec- 
onds. Although such structures might be easily interpreted 
as 'stable' in short simulations, considerisng them in the 
context of a long (is or longer simulation, these metastable 
structures are transient, and some of them can be only 
marginally populated in sufficiently long simulations. We 
have therefore treated the first 300 ns as equilibration and 
carried out all analyses (except when explicitly stated 
otherwise) presented later in the text on the trajectory 
portion beyond this point. 

The average RMSD of the all-atom structure (4.5 A) is 
significantly greater than that of the backbone (3.4 A) and 
can be attributed to the wobbling effect of bases and the 
flexible nature of the TTA loops (Supplementary Figure 
SI). The stacked G-quartet stem is the most stable 
sub-segment of the structure. To highlight regions of 
stability and similarity between the crystal and the 
simulated structure via MD, wire figures coloured by tem- 
perature factors and RMSF have been illustrated in 
Figure 2. The stable (blue) central G-quartets are 
surrounded by more mobile TTA loops (red). Plotting 
RMSF values, calculated from the simulation and 
comparing with the B-factors in the crystal structure, 
highlights similar thermal fluctuations. 

The three loops show non-equivalent dynamics 

Following the flexibility of the loop structure during the 
course of the simulation and comparing them with the 
X-ray structure reveals significant conformational 
changes. However, the loop conformational rearrange- 
ment did not have any great impact on the structure of 
the central G-quartet stem. The RMSF analysis indicates 
that the mobility of each of the simulated loops is inde- 
pendent of one another (Figure 2c and d). Although the 
three loops might appear at first sight to be equivalent, 
they occur in different structural contexts, which make 
their dynamics distinct. The most important factor 
leading to non-equivalence of the loop dynamics is the 
flanking Adel nucleotide. Adel3 from loop2 forms 
hydrogen bonds with the flanking Adel and end caps 
the quadruplex at one end. This interaction once formed 
is retained throughout the simulation. The stable con- 
formation of Adel 3 on top of the terminal quartet 
stretches loop2, and as a result of this arrangement, the 
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Figure 1. Conformational dynamics of human parallel stranded propeller-type quadruplex DNA on different timescale assessed by pairwise RMSD 
matrices. The minimum and maximum was assigned to a linear colour bar to indicate the variation in RMSD. Owing to large sample size, the 
trajectory for (a) 0-10 ns was sampled at 5t = 1 ps; (b) 0-100 ns was sampled at 5t = 10 ps; (c) 0-1000 ns was sampled at St = 100 ps; (d) 0-1 500 ns 
was sampled at St = 150ps. The matrix of 10 ns trajectory shows relaxation of coordinates after first ~2ns. The high RMSD (yellow stripes) in 100 ns 
matrix showed that the trajectory in this duration was still in equilibration. It is evident from matrix (c and d) that the trajectory equilibrates 
at ~300ns. A large conformational transition observed at ~720ns is owing to the formation of a transient triad. 



Thyl 1-Adel3-Thyl2 stacking conformation, as observed 
in the crystal structure, is permanently lost. Bases from the 
other two loops sample an ensemble of transient loop 
geometries, which can be characteristically stable for 
several nanoseconds. For instance, the first thymines in 
loops 1 and 3 (Thy 5 and Thy 17) dynamically interact 
with the middle quartet to form pentad and hexad archi- 
tecture (see later in the text). Moreover, Ade7 also rotates 
around the /-torsion angle to interact via tt-tt stacking 
with Thy 6 and via edge-to-face with Thy 5. We also 
observe in the simulation that the loops interact with the 
quartets to adopt distinct global geometries, and long 
simulation times are required to explore the conform- 
ational space appropriately. 

Sugar pucker and the backbone angles 

The furanose ring in nucleic acids is conformationally 
flexible (43). The backbone torsion angles and sugar con- 
formations in DNA arrange to present an interface for 



proteins and other molecules to interact (44). The 
backbone conformation of the quadruplex has been 
analysed based on the following degrees of freedom: 
X angle, sugar puckering, oc/y and e/£ crankshaft 
motions. The x angle describes the relative glycosyl orien- 
tation and is in either anti (180°-360°) or syn (0°-180°) 
conformation. The sugar pucker describes the extent to by 
which the furanose ring in nucleic acids deviates from a 
plane. The oc and y angles occupy trans (t) and gauche 
(g) ± regions, and oc/y sampling is useful to validate the 
force fields. For example, the oc/y crankshaft motions in- 
definitely trapped in non-canonical regions of oc/y con- 
formational space lead to force-field-dependent artefacts 
(45). The nucleic acid backbone is considered to be in the 
B x form when the c and £ difference is negative and in the 
B n form when it is positive (46). The above-described con- 
formational classes are typically discussed for B-DNA 
duplexes, but they also have relevance to the quadruplex 
backbone. A summary of conformational variability of all 
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Figure 2. Thermal mobility in the parallel stranded propeller-type quadruplex. A comparison of atomic fluctuations in the (a) crystal structure and 
(b) MD simulation is represented as wireframe. The mobility distribution was calculated using B-factors from the crystal structure (PDB id 1KF1) 
and averaged RMSF from the MD simulation. The stable central G-quartets (blue) are surrounded by more mobile loops (red). This is also 
illustrated in (c). The mobility of the loops is independent of each other, (d) Loopl (red) and Loop2 (green) exhibit higher mobility than the 
Loop3 (blue). 



nucleotides is presented in Figure 3, and descriptive 
analysis is presented in the Supplementary section. 

Terminal capping by A:A pair and A:A:A triad 

The 5' adenine (Adel) in the crystal structure protrudes 
away from the core into the solvent (Figure 2a). It adopts 
an anti conformation and does not interact with any other 
part of the quadruplex. In our previous work, we had 
removed this nucleotide assuming its conformation to be 
a crystallization artefact that might affect the stability of 
the quadruplex (20). An NMR solution structure of a 
parallel-stranded quadruplex formed from human telo- 
meric DNA sequence has subsequently been determined 
under molecular crowding conditions (12). It adopts the 
same fold as the X-ray structure; however, the 5' adenine 
is stacked on the adjacent terminal G-quartet. Based on 
this observation, we decided to retain Adel in the starting 
structure. Within the first few nanoseconds of the produc- 
tion run, the glycosidic x angle of Adel changed from anti 
to syn conformation. This leads to formation of an intra- 
molecular hydrogen bond between the 5 ; -OH group and 
N3. An analogous hydrogen bond has been observed in 
experimental structures that have a guanine nucleotide at 
5' termini in a syn conformation (47). Such an intramo- 
lecular hydrogen bond has also been shown to strongly 
stabilize the syn conformation of the first guanine in the 
G-quartet stems in the absence of any upstream nucleotide 



(48). The x angle switches back to an anti conformation 
(~470ns), which is accompanied by a change in sugar 
pucker from Cl'-endo to CY-exo. The /-angle then oscil- 
lates between anti and syn positions during the latter 
course of the simulation (Figure 4a). The flexibility of 
the sugar phosphate backbone allows Adel to form tt-tt 
stacking interactions with guanines on the terminal 
quartet. Stacking is observed with the adjacent Gua2 or 
across with the neighbouring Gua20. The hydrogen bond 
between S'-OH-W is lost when Adel is in anti conform- 
ation. We observe that owing to stochastic backbone 
movements, Adel can flip back into the solvent. 
However, this is an unfavourable and only transient 
event, as Adel subsequently flips back to stack on the 
G-quartet stem. Adel 3 also moves to the top of the 
quartet to form reverse Watson-Crick hydrogen bonds 
with Adel (Figure 4d). The syn conformation of Adel 
and anti conformation of Adel 3 facilitates head-to-head 
base pairing between Adel and Adel 3 (Supplementary 
Figure SlOa). Ade7 from Loopl also comes in-plane 
with the Adel:Adel3 pair to form an A: A: A triad 
(Figure 3e). It bonds to Adel by side-by-side 
trans-sheared A:A pairing, which appears to be the only 
plausible configuration for the A:A:A structure observed 
in the simulation (49). Quadruplex loops are highly 
flexible structures, and the Ti-stacking interactions of 
Ade7 with the first quartet are accompanied by 
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Figure 3. Conformational variability in the backbone of the parallel stranded propeller-type quadruplex. The arrows indicate similar values for 
different bases. 



stereochemical^ permissible distortions in the backbone 
angles of the loop residues. To achieve the chain 
turn-back, the y torsion angle changes from g+ to trans, 
allowing the sheared A:A bonding to take place 
(Supplementary Figure S7) (50). 

Formation of a pentad and hexad 

Analysis of the trajectory also reveals that Thy5 and 
Thy 17 interact with the quartets resulting in diverse 
quadruplex geometries. These thymine bases bury their 
methyl groups into the groove, as the 04 and imino 
hydrogen atoms of Thy5 and Thy 17 hydrogen bond 
with the N2 amino hydrogen and N3 atom of Gua3 and 
Gual5 (Figure 5a and b). However, a pentad and hexad is 
only formed when both 04 and N3 atoms of Thy5/Thyl7 
are hydrogen bonded to a guanine from the middle 
quartet. These interactions ensure that the thymine bases 



are in plane with the quartet. In the crystal structure, 
Thy 17 forms n-n interactions with Adel9. Early during 
the simulation, the Ti-stack with Thy 17 is lost, and 
Adel9 forms equivalent interactions with Thy 18. Thy 17 
moves in-plane with the central quartet owing to changes 
in the backbone torsion angles, to form a pentad 
(Figure 5c). The first indication of Thy 5 and Thy 17 inter- 
actions with the middle quartet occurs at ~ 100 ns, 
strongly suggesting that these features are not observable 
in short simulations. During the course of the simulation 
while Thy 17 makes a pentad with the middle quartet, 
Thy5 vacillates to interact with G-quartet stem in 
various ways. When Thy 5 approaches the plane of the 
middle quartet, and both Thy 5 and Thy 17 are simultan- 
eously aligned with the middle quartet, a planar 
T:(GGGG):T hexad is formed (Figure 5d). This motif is 
maintained by simultaneous hydrogen bonding of Thy5 



Nucleic Acids Research, 2013, Vol. 41, No. 4 2729 




0 60 120 180 240 300 360 q 

dA 1 Sugar Pucker (°) 



0 120 180 240 300 

dA 7 Sugar Pucker (°) 



360 



60 120 180 240 300 360 

dA 13 Sugar Pucker (°) 




Adel 




Figure 4. Time development of A:A base pair and A:A:A triad on the terminal quadruplex. Scattergram of glycosidic chi angle versus sugar pucker 
angle of (a) Adel, (b) Ade7 and (c) Adel 3. (d) The syn conformation of Adel and anti conformation of Adel 3 facilitates reverse Watson and Crick 
A:A base pairing on terminal quartet, (e) Ade7 flips to syn conformation and interacts with Adel to form A1:A7:A13 triad. The K + ion is 
represented as purple sphere. 
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Figure 5. Representation of pentad and hexad alignment. The minimum distance plot of (a) Thy5 and Gua3 and (b) Thyl7 and Gual5 from the 
middle quartet. A pentad is formed when either Thy5 or Thy 17 is in plane with the quartet and within hydrogen bonding distance. A hexad is 
formed when both Thy5 and Thy 17 simultaneously align with the middle quartet and form hydrogen bonds. Structural representation of (c) Thy 17 
interaction through its Watson-Crick face via sheared hydrogen bonds with Gual5 to form pentad with middle quartet, (d) Concurrent alignment of 
Thy5 and Thy 17 with middle quartet through Watson-Crick face forms a T:(GGGG):T hexad. The first 300 ns are highlighted as a yellow box, and 
an arbitrary line at 3.5 A is drawn to highlight hydrogen bond formation. The K + ion is represented as purple sphere. 



2730 Nucleic Acids Research, 2013, Vol. 41, No. 4 



with Gua3 and Thy 17 with Gual5, which is sampled for 
^3% over the equilibrated trajectory. 

Role of ions and hydration in maintaining the structure 
of the quadruplex 

This study finds that K + ions remain positioned between 
the stacked G-quartets throughout the course of the simu- 
lation. There was no exchange of cations between the 
G-quartet stem and bulk solvent as observed in other 
quadruplex systems (19,41,42). The formation of the 
Adel:Adel3 base pair is briefly coupled by the position 
of a K + ion (from bulk) between them. Adel and Ade 13 
interact with the ion by forming water bridges with the 
hydration shell around the K + ion. When Ade 13 is within 
bonding distance with Adel, the K + ion leaves, and a 
stable Adel and Ade 13 base pair is formed 
(Supplementary Figure Sll). Additional K + ions stack, 
albeit briefly, between the base pair and the top quartet 
and also above the A:A base pair (Figure 6a and b). 
Similar role of stabilization by ion has also been 
proposed by Maiti and co-workers (40). Analysis of the 
trajectory also reveals the presence of K + ion coordination 
in the loops, which occurs primarily via phosphate 
backbone atoms and solvent molecules. The dynamic 
movement of the backbone phosphate atoms results in 
the generation of a localized electronegative sink. 
The loop captures a K + ion to compensate for the electro- 
static repulsion (Figure 6c). Similar stabilization of the 
loop by a cation has been observed in the crystal structure 
of the cKit-1 quadruplex (51). An extensive spine of hy- 
dration is also observed, similar to those in the crystal 
structures of quadruplexes (Figure 6d) (51,52). The exten- 
sive solvent network mediates interactions between bases 
from the loop and the G-quartet stem. For example, we 
observe that water bridges mediate the binding of Thy 17 
to Gual5 until Thy 17 comes into direct hydrogen bonding 
with Gual5 (Supplementary Figure SI 2). 



Clustering 

Clustering analysis aims to identify the main substates 
sampled during the simulation. A 3.0 A RMSD cut-off 
identified six clusters (Figure 7). The top three clusters 
account for ~86% of the sampled conformational space. 
The simulation is highly heterogeneous, and diverse struc- 
tural scaffolds are observed. In the pink cluster, 7i-stacking 
interactions dominate in loopl and loop3. This cluster 
appears ~300ns and then again towards the end of the 
simulation at M450ns. The conformation of loopl and 
loop3, observed in this pink cluster (at 1481ns), closely 
resembles the loops of the X-ray/NMR structures 
(Figure 8). The red cluster briefly appears within the 
pink cluster, when the Ti-stacking in loopl is lost. Thy6 
faces outward into the solvent. However, this is a 
short-lived conformation, and the loops reform their 
Thy:Ade:Thy stacking geometry quickly. A stable A:A 
base pair and a pentad (Thy 17) contribute to the conform- 
ation adopted in the green cluster. Ade7 in loopl is pos- 
itioned towards the solvent, similar to Adel in the crystal 
structure. The jump observed at ~720ns, in the orange 
cluster, is a result of the formation of the A:A:A triad 
on top of the terminal quartet. The position of Ade7 
stretches and constrains loopl. It places Thy6 in plane 
with the middle quartet. However, the distortions in the 
phosphate backbone prevent Thy6 to form hydrogen 
bonds with the middle quartet. Meanwhile, Thy 17 inter- 
acts with the middle quartet to form a pentad. The orange 
cluster reappears at ~ 1240 ns, highlighting a similar con- 
strained conformation of loopl. The black cluster is 
defined by the formation of A:A base pair and a pentad 
(Thy 17) along with other u-stacking arrangements within 
the loops. In the blue cluster, the interactions of loop3 
with the quartet are lost, and the bases orient towards 
the solvent. The structures observed in the orange and 
blue clusters are transient conformations that lie in the 
more stable black cluster. 




Figure 6. Role of bulk water and ions in stabilization of the topology. Additional K ions positioned (a) between the terminal quartet and the A:A 
diad and (b) on top of the A:A diad. This is analogous to cation bonding between the quartets, (c) K + ion coordination with the backbone atoms and 
water molecules. Ion capture in the loops occurs when an electronegative sink is generated owing to the close proximity of phosphate backbone 
atoms. The ion helps in stabilizing the inter-phosphate repulsion, (d) Spine of hydration as observed in a snapshot during MD simulation. The 
hydration networks are similar to those observed in quadruplex crystal structures. The K + ion is represented as purple sphere, whereas the solvent 
(water) is coloured red. 
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Figure 7. Ensemble of conformations identified via clustering analysis. RMSD-based clustering, with a cut-off of 3.0 A, over equilibriated trajectory 
identified six clusters. Cartoon representations of top and side view of cluster centres are illustrated. The orange and blue clusters are transitional 
substates observed within the black cluster, whereas the red cluster appears within the pink cluster. 



Comparison with experimental data 

The present analysis focuses on the maximally sampled 
base pairing events that modify the topology of the 
quadruplex DNA. Many of these conformations are un- 
precedented with regard to the simulation of intramolecu- 
lar quadruplexes. An important point is that the 
simulation, despite showing a wide range of loop 
geometries, is capable of spontaneously sampling loop 
structures that are similar to those in the currently avail- 
able atomistic experiments. Clustering analysis helped in 
identifying a sub-state (at 1481ns) where the conform- 
ation of loopl and loop3 closely resembles the NMR 
and crystal structure. A superimposition of the identified 
conformation with NMR and crystal structures yielded an 
RMSD of <2.0A (Figure 8). Importantly, the individual 
loops 1 and 3 occasionally also transiently sample such 
geometry in the other parts of the trajectory, which indi- 
cates that the experimental loop geometry, although not 
being the most stable one, is within the low-energy region 
of the simulation. Loop2 is not sampling this geometry, as 
it is constrained by the A1:A13 base pairing. This, 
however, does not a priori mean a disagreement with ex- 
periment. Although Al in the X-ray structure is involved 
in crystal contacts, there is a significant non-equivalence 
between the NMR and simulated sequences, namely, in 
the NMR experiment, the flanking sequence is 5'-TA- 




Figure 8. Comparison of the loops in the (a) simulated structure 
(pink) with the X-ray (blue) and the NMR structure (green). The 
superimposition highlights the similarity between the structures from 
(b) loopl and (c) Loop3. The snapshot from the simulation was 
extracted at 1481ns. 
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instead of 5'-A-. There is thus no free adenine 5'-OH 
group in the NMR structure so that the intramolecular 
hydrogen bond stabilizing the adenine in the syn conform- 
ation cannot form. Owing to this, the adenine in the 
NMR structure is in a high-<2^/ x region, which is 
common in DNA. 

Our study further reports non-canonical base pairing. 
The A:A pair formed by Adel and Adel3 has a Watson- 
Crick Nl hydrogen bond, which has been reported in 
RNA structures such as those in PDB structures id 
1LNG and 1HR2 (53,54). Interactions of Ade7 with the 
backbone of Adel are typical of interactions commonly 
observed in RNA structures (55). Theoretically, an A:A 
pair with Watson-Crick hydrogen bonding can form an 
A:A:A triad only through trans sheared Hoogsteen bonds 
(49). This is the same architecture that is observed in the 
present simulation. A comparable structure with an 
A:A:A triad has also been observed in the 30 S ribosomal 
subunit (PDB id: 1FJG) (56). The T:(GGGG):T hexad, 
which forms briefly when Thy 5 and Thy 17 are simultan- 
eously aligned with the middle quartet, has been observed 
in the structure of a bimolecular quadruplex (PDB id 
1XCE) (57). The Ti-stacking interactions within the loops 
have been observed in our previous simulation of 
quadruplex multimers and are consistent with several ex- 
perimental observations of loop geometries including 
those seen in quadruplex-ligand interactions (20,58,59). 
The frequent sampling of stacking interactions between 
Ade-Thy and Thy-Thy observed in the loops suggests 
that these interactions contribute to stabilizing quadruplex 
geometry. These interactions have been evident in several 
other previous simulations of quadruplex (60). The oc/y 
torsion angles of the loop bases observed in the present 
work are also in agreement with the oc/y angles as observed 
in the crystal structures (22). 



DISCUSSION AND CONCLUSIONS 

The present study provides an unprecedented dynamic 
atomistic picture of the propeller-type loops, covering 
1.5 jis time window with ps-scale time resolution. It is 
fair to admit that the dynamics (mainly balance between 
different substates) may be affected by the approximations 
of the simulation force field, as single-stranded nucleic 
acids topologies are specifically challenging for force- 
field description (23,61,62). This may contribute to the 
fact that the experimental geometry of the loops, albeit 
sampled in the simulations transiently, is not the 
dominant simulation sub-state. Obviously, also the simu- 
lation time scale of the present study is still far from full 
convergence; therefore, we cannot claim that we see 'cor- 
rect' populations of the substates. For both these reasons, 
the relative balance of different structures seen in the 
simulation should not be over-interpreted, although we 
suggest that the simulation likely samples relevant 
substates. However, a clear result of our simulation is 
that the conformational space of the loops is rich and 
more dynamic than we have assumed to date. The simu- 
lation results suggest that the loops may exist as a dynamic 
continuum of interconverting substates, which would be 



difficult to fully capture by available experimental 
methods. In addition, we show that long simulations are 
needed to characterize the quadruplex DNA loop 
dynamics sufficiently exhaustively and without a visible 
bias from the starting structure. Considering just the 
first 300 ns (which is still a much longer trajectory than 
the majority of those analysed in preceding studies) would 
be, regarding the loop dynamics, rather misleading. 

One of the most interesting structural observations here 
is end capping of a quadruplex with the terminal adenine 
base. Such interaction has also been observed in a guanine 
and adenine-rich quadruplex (63). Stacking of a G-quartet 
by a Ade(anti):Ade(syn) base pair is also evident in this 
structure as well as in another bimolecular quadruplex 
(57,63). Interaction of a pyrimidine base with an 
Ade(syn):A(anti) base pair to form a triad has also been 
reported (63). However, to the best of our knowledge, this 
is the first observation of a stable terminal A:A base pair 
and an A:A:A triad in a quadruplex simulation. The 
A:A:A triad with similar bond directionality has been 
observed in an experimental RNA structure (64). The 
current basis for quadruplex-targeted drug design is in 
large part that end stacking on G-quadruplexes is the 
principal mode of interaction for extended heteroaromatic 
ligands (65). The A:A stacks in our simulation corroborate 
with the fact that Ti-stacking interactions over the terminal 
quartet stabilize the quadruplex topology. Also, the A:A 
pairing on the terminal quartet provides a large surface 
area for ligand binding. This enables molecules whose 
chromophores have a larger surface area than, for 
example, a diamido-acridine moiety, to bind to the ends 
of a quadruplex (22). It is, however, fair to admit that in 
our simulations to a certain extent, the A1:A13 interaction 
can be facilitated by the specific 5'-Al single-nucleotide 
end capping, which supports the Al syn conformation 
via the S'-end-specific 5'-OH. . .N3 intramolecular 
H-bond. This adenine is involved in crystal packing in 
the X-ray structure, and in the NMR structure, it is not 
preceeded by any nucleotide so that the intramolecular 
hydrogen bond cannot be formed. 

While exploring the conformational flexibility of the 
TTA loop, neither the pairing of the guanines nor 
the stacking geometry of the quartets was altered by the 
linking topology of the external loops. We observe that 
although the G-quartet stem of the quadruplex remains 
rigid, the bases in the loops can form base-pairing align- 
ments that further stabilize the quadruplex. Higher-order 
pairing alignments such as a pentad, a hexad or a heptad, 
resulting from interactions of adenines with a G-quartet, 
have been observed in the 93del aptamer and other 
quadruplex-forming structures (47,66-68). These align- 
ments are formed when the major groove edge of 
adenine forms sheared trans Hoogsteen bonds with 
guanine in a G-quartet (47,63). Hexad alignment with 
thymine as the interacting base has been reported in the 
NMR solution structure of the bimolecular quadruplex 
d(GCGGTTGGAT) (57). This structure involves the 
Watson-Crick face of a thymine forming sheared 
hydrogen bonds with a guanine in quartet. This inter- 
action effectively perturbs the quadruplex shape from a 
circular towards an expanded toroid architecture (57). 
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The sugar conformations in the hexad, in our simulation, 
remain predominantly in C2 '-endo pucker. A notable ex- 
ception occurs in Thy 17 (Supplementary Figure S9). The 
change in sugar pucker to C3 / -endo in Thy 17 allows it to 
become in-plane with the central G-quartet and 
hydrogen-bond with Gual5. This switch in sugar pucker 
is to avoid severe steric overlap with the sugar in Thy 18. 
All the guanine nucleotides in the quadruplex are in a B x 
conformation except for Gual5 (Supplementary Figures 
S2-S4). The inter-phosphate repulsion is a dominant 
factor that causes a switch from B x to a B n conformation 
(46). The flexibility in Gual5 facilitates Thy 17 interaction 
with the middle quartet and maintains stable interaction 
for much of the simulation time. The 29% sampling of B n 
conformation by Gua3 suggests that the broadening of the 
groove is brief, and therefore, the interaction of Thy5 with 
the middle quartet is not as evident as that of Thy 17. It 
has been suggested that the B x form is intrinsically of 
lower energy than the B n form owing to the more favour- 
able orientation of the phosphodiester backbone 
(46,69-71). However, the energy required to populate 
the B n conformation can be gained from other energy 
contributions such as base stacking or hydration. 

The K + ions sandwiched between the G-quartet are 
retained in antiprismatic bipyramidal coordination. The 
exterior of the quadruplex also coordinates with the ions 
via an extensive water network (Figure 6). Coordination 
networks of a nucleic acid with the solvent are observed 
when non-canonical base pairing play an important role in 
stabilizing tertiary structures (72). In accordance with this, 
we observe that formation of the adenine cap is stabilized 
by a water-K + bridge (Supplementary Figure Sll). 
Furthermore, K+ ions have also been involved in stabil- 
ization of loops. A water bridge is also formed between 
Thy 17 and Gual5, suggesting that solvent plays a role in 
maintaining intra-molecular contacts within the 
quadruplex (Supplementary Figure SI 2). 

The present 1.5 (is simulation of a human telomeric 
quadruplex DNA represents 750 000 frames of structural 
data. Empirical analysis of this vast data set may result in 
the loss of important ensembles. Therefore, the trajectory 
was partitioned into clusters of structures that share 
similar conformational space. The RMSD-based cluster- 
ing analysis groups the trajectory into six distinct con- 
formations. These clusters have been correlated with the 
trajectory after 300 ns to infer the conformational 
pathway in the part of the simulation, which is already 
not visibly affected by the starting configuration 
(Figure 7). The clusters coloured pink, green and black 
have low median values, highlighting metastable regions. 
The orange, blue and green clusters can form within the 
black cluster, suggesting an inter-convertible conform- 
ation. The red cluster observed within the pink cluster, 
which appears both at the start and end of the simulation, 
suggests the presence of a transition state in the metastable 
pink cluster. The clustering analysis makes it clear that 
although the trajectory has equilibrated, it has not yet 
converged, indicating that the conformational sampling 
of loops is not complete. 

The simulation reflects the flexible and dynamic nature of 
loops as observed in shorter timescale simulations. The 



loops during the simulation exhibit conformations that 
resemble those observed in the solution and crystalline 
forms. Amongst the loops, loopl and loop3 transiently 
show arrangement consistent with the crystal/NMR struc- 
ture (Figure 8). Ade-Thy stacking in loop2 is lost as a result 
of the formation of the Adel:Adel3 base pair over the 
terminal quartet. The distinct structural scaffolds that are 
observed in the present simulation have been experimen- 
tally reported in other G-quadruplex and RNA structures. 
However, these conformations have not been reported in a 
human telomeric quadruplex. The ability of loops to 
interact with the G-quartet stem in different ways is a sig- 
nificant feature of the current simulation. This provides 
insight into new conformational topologies and geometries, 
which may be captured and used by drugs designed for 
stabilizing quadruplex in human telomeric regions. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Tables 1 and 2 and Supplementary 
Figures 1-14. 
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